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The effect of weak gravitational lensing on the cosmic microwave background (CMB) temperature 
anisotropics and polarization will provide access to cosmological information that cannot be obtained 
from the primary anisotropics alone. We compare the information content of the lensed B-mode 
polarization power spectrum, properly accounting for the non-Gaussian correlations between the 
power on different scales, with that of the unlensed CMB fields and the lensing potential. The 
latter represent the products of an (idealised) optimal analysis that exploits the lens-induced non- 
Gaussianity to reconstruct the fields. Compressing the non-Gaussian lensed CMB into power spectra 
is wasteful and leaves a tight degeneracy between the equation of state of dark energy and neutrino 
mass that is much stronger than in the more optimal analysis. Despite this, a power spectrum 
analysis will be a useful first step in analysing future B-mode polarization data. For this reason, we 
also consider how to extract accurate parameter constraints from the lensed B-mode power spectrum. 
We show with simulations that for cosmic-variance-limited measurements of the lensed B-mode 
power, including the non-Gaussian correlations in existing likelihood approximations gives biased 
parameter results. We develop a more refined likelihood approximation that performs significantly 
better. This new approximation should also be of more general interest in the wider context of 
parameter estimation from Gaussian CMB data. 



I. INTRODUCTION 

The goal of many current and forthcoming cosmic mi- 
crowave background (CMB) experiments is to measure 
the polarization of the CMB with increasing accuracy 
(e.g. QUaD [1], BICEP [2], EBEX [3], QUIET 1 and 
Clover [4]). There are several motivations for this: po- 
larization measurements are an important test of the 
consistency of the cosmological model, as well contain- 
ing new information, and can break parameter degen- 
eracies which are present if only the CMB temperature 
power spectrum is analysed [5]. A key motivation is the 
fact that the curl-like B-mode polarization is not gener- 
ated directly from scalar perturbations at last scattering, 
and thus can potentially reveal the presence of a primor- 
dial gravitational wave background, if it exists at a high 
enough level [6, 7]. A detection of primordial large-angle 
B-mode polarization would thus provide a measure of the 
expansion rate and hence energy scale of inflation. Apart 
from the reionization bump around t ~ 5, which depends 
on the optical depth to last scattering, the gravitational 
wave signal peaks at multipolcs t ~ 100, corresponding 
to the angle subtended by the horizon size at last scat- 
tering, and so large area surveys (a few hundred deg 2 ) 
will be required to detect it. On smaller scales, the dom- 
inant contribution to B-mode polarization is expected to 
be the weak lensing of B-modcs by large scale structure 
along the line of sight [8]. 

The lensing signal is sensitive to the gravitational po- 
tential, and hence the clustering of matter, mostly at 
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redshifts < 10 [9]. (At the peak of the lensing deflection 
power spectrum at multipoles i ~ 40, more than 95% of 
the power arises from z < 10.) Lensing is therefore sensi- 
tive to parameters that have little direct effect on the pat- 
tern of primordial fluctuations, but affect the late-time 
growth of large-scale structure. The lensing signal can 
thus provide us with information which would otherwise 
be absent from the CMB [10, 11], such as the properties of 
dark energy and sub-eV neutrino masses [12]. However, 
the lensed B-modes are significantly non-Gaussian, and 
this has been shown to degrade markedly the amount of 
information present in the B-mode power spectrum [13]. 
The non-Gaussianity introduced by weak lensing is also 
present to a much lesser extent in the CMB temperature 
anisotropies and B-mode polarization signal. It has re- 
cently been shown that neglecting this non-Gaussianity 
does not significantly bias parameter constraints from 
Planck-quality data, although the effect of weak lensing 
on the power spectra cannot be neglected [14] . 

In the present paper we consider more fully the ques- 
tion of what parameter information is contained in the 
lensed B-mode power spectrum and how to analyse fu- 
ture spectral data reliably. We concentrate on the B- 
mode spectrum here for the following reason. The ex- 
pectation is that for future experiments with sufficient 
signal-to-noise to image the lens-induced B-modcs, most 
of the additional parameter information from the lens- 
ing effect on the CMB power spectra will come from the 
B-modes. Although the information content of the B- 
mode power spectrum is relatively more affected by the 
non-Gaussianity than the temperature (T) and B-mode 
polarization, that latter must contend with the cosmic 
variance of the dominant primary (unlensed) contribu- 
tions. It is well known that for Gaussian fields the ob- 
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served power spectrum is a sufficient statistic for param- 
eter estimation under ideal survey conditions, but that 
this is not the case for non-Gaussian fields such as the 
lensed CMB. Compressing the lensed CMB fields to their 
power spectra is wasteful leading to a loss of cosmological 
information. A more optimal analysis would be to use the 
non-Gaussianity to reconstruct an estimate of the lcnsing 
deflection field and the unlenscd CMB fields [15, 16], or 
to work directly from the correct non-Gaussian likelihood 
function [17, 18]. While such analyses are a worthy goal 
to strive for, they will likely be difficult to implement 
in practice in the presence of real-world complications 
such as inhomogeneous noise, complex survey geometries 
and foreground residuals. A simpler, and probably more 
robust method to deal with near-future lensed data is 
to work directly with the lensed fields in a conventional 
power spectrum analysis. 

This paper is organised as follows. In Section II we re- 
view how -B-mode polarization is generated by the weak 
lcnsing of -E-modes, and consider the cosmological pa- 
rameters which can be constrained using this informa- 
tion, and the degeneracies between them. In Section III 
wc estimate these parameters from simulated lensed data 
under the assumption that the data are Gaussian, in or- 
der to show directly that this leads to false conclusions. 
In order to account for the non-Gaussianity we need to 
understand the correlation between the measured power 
on different scales, and we calculate this in Section IV 
following Ref. [13]. In Section V we incorporate the non- 
Gaussian covariancc in existing likelihoods used for the 
analysis of (Gaussian) CMB data but show that this too 
leads to biased results in simulations. The reason for this 
deficiency appears to arise from inaccuracies in the cur- 
rent likelihood approximations when applied to data with 
few degrees of freedom (which extends to higher multi- 
poles for the lensed S-modes than for Gaussian fields). 
We introduce a more accurate likelihood function in Sec- 
tion VI which we show to perform much better on simu- 
lated lensed data. An appendix provides further details 
of the derivation of this new likelihood. 



II. COSMOLOGICAL PARAMETERS 

The action of lensing on the part of the temperature 
anisotropy and polarization that is generated at recom- 
bination is a simple re-mapping that in the flat-sky ap- 
proximation can be written as 

X(n) = X(n + V0). (1) 

Here X(n) is the unlenscd variable (T, or the Stokes pa- 
rameters Q and U), X(n) the lensed value and <fi the 
lensing deflection field. On the sphere, we need to take 
care to interpret correctly the meaning of V</) [19] and 
to take into account the position-dependence of the co- 
ordinate basis for polarization. The root-mean-square 
lcnsing deflection angle ~ 3 arcmin and so the lcnsing ac- 
tion on CMB fields well above this scale can be evaluated 




FIG. 1: (Colour online) Unlensed EE power spectrum (red 
solid line) and power spectrum of lensing potential (blue dot- 
ted line) which together generate the lensed BB power spec- 
trum (shown by green dashed line, multiplied by a factor of 
400). 

with the gradient approximation: 

X(n) w X(n) + VI ■ V0. (2) 

Within this approximation, and in the fiat-sky limit, we 
can express the lensed E-mode power spectrum as a con- 
volution of the unlensed E-mode power spectrum with 
the power spectrum of the lensing potential <fi [13]: 

Cf B = J 0fCpC^W\l,a (3) 

where function W is given by 

W (£,£') =£'■(£- £') sin 2(& - ), (4) 

with £ = (£cos(j)£,£sm(j)g). This is illustrated in Fig. 1, 
which shows the unlensed E-mode and lensing poten- 
tial power spectra, together with the resulting lensed B- 
mode power spectrum. The power spectra were calcu- 
lated using CAMB 2 which includes corrections for curved 
sky effects and the breakdown of the gradient approxi- 
mation [20]. 

The lensed E-mode power spectrum is thus affected 
by cosmological parameters which affect either the pri- 
mary (unlenscd) CMB power spectrum Cf E or the lens- 
ing power Cf^. Many of these parameters can be well 
constrained by the temperature and E-modc power spec- 
tra, and the lens-induced E-modes do not contribute sig- 
nificantly to improving these constraints. Here we con- 
centrate on those parameters that are degenerate with 
respect to the unlensed CMB spectra. As is well known, 
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the unlcnsed spectra can provide tight constraints on the 
physical densities in baryons and cold dark matter, the 
angular diameter distance to last scattering, the primor- 
dial power spectrum, and the optical depth to reioniza- 
tion [21]. Neutrinos with masses below ~ 0.3 eV, as im- 
plied by current analyses of large-scale structure and the 
Ly-cv forest [22], are relativistic at recombination and the 
only effect of their mass on the unlensed CMB is via the 
angular diameter distance, d,A, and a small large-scale 
contribution to the late integrated Sachs- Wolfe (ISW) 
effect [23]. Similarly, dark energy parameters, such as 
the current energy density fl^h 2 , and equation of state 
w are felt only through <1a and the ISW effect. In the 
inflation-inspired flat models considered here, the param- 
eters n^h 2 , w, and the current density in massive neutri- 
nos fl u h 2 (or equivalcntly the neutrino masses) are highly 
degenerate with respect to the unlensed CMB, even for 
cosmic-variance-limited observations. They are collec- 
tively constrained only by d^, which is now accurately 
determined to be 13.7±0.5 Gpc from the CMB alone [24]. 
Adding curvature or evolution in the dark-energy sector 
increases the dimensionality of the degeneracy. 

While this 'geometric' degeneracy can be easily broken 
with the inclusion of external data [21], it is still worth- 
while to consider to what extent the lensing of the CMB 
helps given its relatively simple and well-understood 
physics. An early analysis of the temperature and E- 
mode polarization power spectra showed that the weak 
lensing effect on the power spectra can go some way to 
breaking the geometric degeneracy [11] since the late- 
time evolution of the gravitational potential responds 
differently to variations in the parameters that are ge- 
ometrically degenerate. Since for multipoles I > 100 the 
-B-mode power spectrum is expected to be dominated 
by weak lensing, this is potentially a much more sensi- 
tive probe of the lensing effect than the temperature or 
-E-mode power since there is no cosmic variance coming 
from unlensed -B-mode polarization. For a given sur- 
vey, once the sensitivity reaches the limit for imaging the 
lens-induced B-modes (better than 5 /iK-arcmin) we can 
expect these to be most constraining. To reach the ulti- 
mate cosmic- variance limit considered later in this paper, 
we require sensitivities around 1 ^K-arcmin over a large 
fraction of the sky. This is similar to the specifications 
being discussed for a post-Planck orbital experiment ded- 
icated to CMB polarization 3 . 

In Fig. 2 we show the derivatives of the lensing poten- 
tial and B-mode power spectra with respect to w and 
Q, v h 2 in a flat universe with the physical baryon and 
cold dark matter densities held fixed at f^/i 2 = 0.024 
and Q, c h 2 = 0.111. The derivatives are taken around 
w = —1 (a cosmological constant) and Q, v h 2 = 0.004 
with the ratio 0d of the sound horizon at last scatter- 
ing to the angular diameter distance held fixed. If we 




FIG. 2: (Colour online) Derivatives of the lensing power spec- 
trum l^Cf^ (top) and the lensed -B-mode power spectrum 
£(£ + l)Cf S /(2tt) (bottom), keeping 6 D fixed (see text). Red 
solid line: derivatives with respect to fi„/i 2 ; blue dashed line: 
derivatives with respect to w (amplified by a factor of 350 in 
both plots). 



parameterise flat models by 8d, w and fl„h 2 , both the 
Hubble parameter h and fl\h 2 are derived parameters. 
Our choice of parametcrisation is motivated by the geo- 
metric degeneracy: the unlensed CMB accurately deter- 
mines all our parameters expect w and fl^h 2 which are 
themselves very poorly constrained in the absence of ex- 
ternal information. The power spectra used to compute 
these derivatives were computed with the input unlensed 
power spectra set to zero above the maximum multipole 
value f max = 2048 to be consistent with the analyses of 
simulations presented in the following sections of this pa- 
per. We have assumed three families of neutrinos with 
equal masses. Since m v = Q v h 2 x 94 eV when all fami- 
lies are non-relativistic , the mass in the fiducial model is 
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neutrino masses tend to zero (for three families of leptons). It 
is a good approximation to take the neutrino mass of a family 
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0.13 eV. The total mass is close to the current best limits 
from cosmological probes [22]. Given the measurements 
of (squared) mass differences from atmospheric and solar 
neutrino experiments, our value of Q, v h 2 is just at the 
limit where we can assume mass degeneracy [25]. We 
note also that we have not included any non-linear cor- 
rections to the matter power spectrum in computing the 
spectra in Fig. 2 since the HALOFIT [26] fitting employed 
in CAMB does not currently support massive neutrinos or 
models with w ^ — 1. Non-linearities increase the lens- 
ing power spectrum beyond I ~ 300, and have an effect 
> 5% on all scales for the B-mode spectrum, and a much 
larger effect on small scales [20]. Non-linearities should 
thus properly be included in any future data analysis. 

The effect of changes in w and VL v h 2 on the lensing 
power spectrum has been discussed in Ref. [27]. The 
main effect of changes of w is through the change in the 
expansion rate. An increase in w at fixed 9d (or, equiva- 
lcntly, (Ia) requires a reduction in the current dark-energy 
density and hence h. However, increasing w causes dark 
energy to dominate earlier and the net effect is an initial 
enhancement of the expansion rate over that for w = — 1; 
this causes the gravitational potential to decay earlier 
and suppresses the lensing effect. Hence the derivative 
of the lensing potential power spectrum with respect to 
w is negative; see Fig. 2. The effect on the potential 
is almost independent of scale, but its time dependence 
causes a larger fractional change in lensing power on large 
scales. If we fix the current physical density in dark 
energy, the effect of massive neutrinos on the gravita- 
tional potential vanishes on large scales (larger than the 
Jeans length of the neutrinos when they first become non- 
relativistic) since the increased expansion rate is offset by 
the neutrino clustering. The effect on the lensing power 
spectrum thus also vanishes at large I, On scales below 
the neutrino Jeans length today the neutrinos have never 
clustered and their mass gives a scale- independent sup- 
pression of the gravitational potential similar to the effect 
of increasing w. On intermediate scales, there is a scale- 
dependent suppression of the lensing power. The addi- 
tional effect of keeping 0q fixed is to subtract off a small 
proportion of the suppression that arises when Q\h is in- 
creased so that neutrino mass then gives a small positive 
enhancement of lensing power at low £ and a suppres- 
sion for larger £ (Fig. 2). The different scale dependence 
of the effects of neutrino mass and w allows them to be 
separated if the lensing potential can be accurately deter- 
mined. In this way one can determine both w and Q, v h 2 
accurately, whereas neither (nor any combination) can 
be determined accurately from the unlensed CMB alone. 
Forecasts for such constraints were given in Ref. [27], 



to be proportional to its contribution to fi„/i 2 provided that 
m„ > 0.004 eV. Since in the November 2004 release of CAMB 
that we use the (degenerate) masses are calculated as ^ m v = 
Cl v h 2 X 94 eV, it is important to avoid calculating derivatives 
with respect to Q v h 2 around Q v h 2 = 0. 
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FIG. 3: (Colour online) Kernels Cy&Cf B /dCf,"' for I = 10 
(red), I = 500 (blue) and I = 1000 (green). These give (ap- 
proximately) the contribution to the lensed B-mode spectrum 
from lenses at scale £' . 



which used for the errors on Cf those expected from an 
application of the quadratic reconstruction technique of 
Ref. [16]. This exploits the local scale-scale correlations 
in the non-Gaussian lensed CMB fields to reconstruct a 
(noisy) estimate of the deflection field. Improvements in 
the reconstruction of the deflection field may be possi- 
ble with more optimal techniques [18]. Reference [27] 
found that a post-Planck experiment could determine w 
to within an error of 0.18 and detect the mass of a single 
family of massive neutrinos if m„ > 0.04 eV. 

Figure 2 shows that w and VL^h 2 are largely degen- 
erate with respect to their effect on the i?-modc power 
spectrum. This is because for no I is the dominant con- 
tribution to Cf B coming from the largest angle lenses 
for which there is a clear difference between the effects 
of varying w and £l v h 2 \ for i > 200, the effect of varying 
w or Q, v h 2 is almost degenerate in the lensing potential 
power spectrum and hence in Cf B . In Fig. 3 we plot the 
derivatives Cf?&C BB /dCf? as a function of £' for several 
values of £. For any £, the integral of the kernel over £' 
gives the lensed B-mode power spectrum at that £ (in the 
gradient approximation, Eq. 1). Similarly, for the param- 
eter variations considered here that leave the unlensed 
-E-mode power spectrum unchanged, the convolution of 
the kernel Cf?ACf B /dCf? with the fractional change in 

Cf? gives the change in the lensed B-mode spectrum. 
The shape of the kernels in Fig. 3 can be understood as 
follows. For £ < 200, the dominant contribution to the 
integral in Eq. (3) comes from £' £ since the i?-modcs 
have very little power at the large scale £. In this limit, 
the lensed -B-mode spectrum is approximately white and 
the kernel reduces to 

Ctf^Xr-r « —£' 5 Cf?C BE , (5) 

1 dCf 4tt 1 e 
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for I <C £'■ For I close to 1000, around the peak of the 
.E-mode spectrum, the dominant contribution to Cf B is 
from larger scale lenses and the kernel is of the form 



r«P< 



AC, 



BB 



1 



- nib rt>t>4> i r< 
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for I > £'. Here, {Cf E )e is a smoothed version of Cf E 
where the smoothing is with a bimodal kernel of total 
width 21'. If t! <C 200 the smoothing has little effect and 
{Cf E ) t ra C EE . Finally, for I > 5000 (not shown in 
Fig. 2), the -B-mode power arises from lenses at the same 
scale across which the CMB may be approximated by a 
gradient [28]. In this limit, the kernel is roughly 
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where the factor in brackets is the mean-squared gradient 
of the polarization. 

Since w and f2„/i 2 are essentially degenerate in the B- 
mode power spectrum, the consequence of this is that, 
if we use only the lensed -B-mode power spectrum and 
do not consider higher-order statistics, there is a tight 
degeneracy between the two parameters. This is still an 
improvement over what can be achieved from the un- 
lcnsed CMB spectra alone which give no constraints on 
cither parameter, but clearly falls short of what would be 
achieved if the lensing potential could be reconstructed 
directly. While this is a clear motivation for developing 
further such reconstruction techniques, for the reasons 
mentioned in Sec. I, it will still be worthwhile to perform 
an initial power-spectrum-based analysis. Furthermore, 
external data on e.g. the Hubble parameter, will assist 
in breaking the degeneracy between w and Vl v h 2 in a 
power-spectrum-only analysis. For this reason, we now 
consider in more detail how to analyse future B-mode 
power spectrum data taking careful account of the de- 
pendencies between power on different scales. 



III. ANALYSIS ASSUMING GAUSSIANITY 

We begin by illustrating the effect that non- 
Gaussianity has on estimated parameter constraints by 
(wrongly) analysing simulated lensed maps with the like- 
lihood function appropriate for Gaussian fields. This ex- 
tends the work of Ref. [13] who showed that the Fisher 
estimate of the error on the amplitude of the lensed B- 
mode spectrum is under-estimated if Gaussianity is as- 
sumed. 

We used the publicly-available LensPix 5 code [14] 
(which is based on a modified version of HEALPix 1.2 6 ) to 
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create simulations of the lensed CMB on the full sky. So 
that the effects of non-Gaussianity could be clearly seen 
we considered the idealised full-sky case with no noise or 
foreground sources. The lensed map was calculated di- 
rectly at the pixel centres of the deflection map from the 
spherical multipolcs of the unlcnscd polarization. This 
avoids introducing any unwanted bias due to the effects 
of pixelization error which may arise when using the al- 
ternative interpolation method in the LensPix code, but 
is much slower and limited the number of simulations 
that we could produce. The sky was generated using the 
HEALPix resolution parameter iV s ide = 1024 (giving 3.4- 
arcmin pixels) with l max = 2048 to avoid aliasing. The 
theoretical power spectra used in the analysis were cal- 
culated also using £ max = 2048. This leads to a lensed 
B-mode power spectrum that lacks power particularly on 
smaller scales, but our analysis is self-consistent. 

We restricted the analysis to constraining only the val- 
ues of w and the tensor/scalar ratio r, within a flat dark- 
energy CDM cosmology with no massive neutrinos. The 
strong degeneracy between w and Q, v h 2 means that it 
was not useful to vary both parameters, and omitting 
massive neutrinos significantly speeds up the computa- 
tion of the theoretical power spectra. The models used 
had scale-invariant curvature fluctuations (n s = 1) with 
no running. The spectral index of tensor fluctuations was 
set to n t = — r/8, in accordance with predictions of slow- 
roll inflation (see, e.g. Ref. [29] and references therein). 
The values of the physical baryon and CDM densities, 
flbh 2 and f2 c /i 2 , were held fixed along with 6*£>, so that 
f^A and h are derived parameters. Two fiducial mod- 
els were used, both with w = — 1, but the second with 
a much lower value of r to see if the non-Gaussianity 
had a significant effect on the estimated values of r. 
Model A had a high value of r = 0.4 (close to current 
upper limits [22]) with tt b h 2 = 0.024, tt c h 2 = 0.111, 
Hq = 0.66 km s^ 1 Mpc -1 , the amplitude of scalar fluctu- 
ations on 0.05 Mpc -1 scales A s = 2.6 x 10 -9 and optical 
depth r = 0.2. Model B used r = 0.01, fl b h 2 = 0.0234, 
n c h 2 = 0.111, Ho = 73 km s- 1 Mpc" 1 , A s = 2.3 x 10" 9 
and t = 0.14. (The value of A s e~ 2r was the same for 
both models.) 

We extracted the measured power spectrum Cf B of 
the simulations and constrained r and w with the like- 
lihood appropriate to full-sky, noise-free observations of 
Gaussian fields: 



H_i «£i j (2£+l)C e 



P(C\C)ocY[C e 2 C t 2 exp 



2Ce 



(8) 



where Ci is the theoretical power spectrum. The nor- 
malisation of the likelihood function with respect to Cf 
is usually dropped since in practice it is a fixed quantity 
when constraining parameters. We assumed flat priors 
on the values of w and r. 

Figure 4 shows the parameter constraints obtained 
from three Model-A simulations and seven Model-B sim- 
ulations using the likelihood in Eq. (8). It can be seen 
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FIG. 4: Parameter constraints obtained from full-sky lensed 
simulations using the B-mode power spectrum only and as- 
suming Gaussianity. Contours are shown at the 67%, 95% 
and 99% confidence limits. Top: Model-A simulations. Bot- 
tom: Model-B simulations. The parameters in the fiducial 
model are marked with a cross. 



that the constraints arc, in most cases, inconsistent with 
the input value of w = — 1. However, the marginalised 
constraints on r are in accordance with the input val- 
ues. There appears to be a bias towards low values of w, 
and the spread of the maximum-likelihood values indi- 
cates that the constraints on w are too tight. The latter 
is consistent with the findings of the Fisher analysis in 
Ref. [13]. 



IV. NON-GAUSSIAN COVARIANCE 

Equation (3) shows that the power in lensed S-modes 
on any particular scale arises from the power on a range 
of scales in E and <f>. As can be seen in Fig. 1, the power 
in the lensing deflection field peaks at large scales, mean- 
ing that a significant fraction of the power in B-modes on 
scales around the peak is generated by large-scale lenses 
(see also Fig. 3). A particular mode in the lensing deflec- 



tion field will lens many different _E-modes to generate 
B-modes on a range of scales. The power in these B- 
modes will be related to the power in the single mode 
of the deflection field and so the measured power in B- 
modes is significantly correlated between scales [13]. In 
order to account for this non-Gaussianity in our analysis, 
we need to calculate the covariance between the measured 
Cf B . The covariance can be expressed as a sum of the 
Gaussian and non-Gaussian parts: 



Cav(C?.» ,Of B ) = S,, 



: NG 



(9) 



where the Gaussian part arises from the non-connected 
part of the £>-mode four-point function and on the full 
sky is given by 



2(Cf B ) 2 
21+1 



9° - <5 •• 



(10) 



Lensing has only a very small effect on any primordial 
-B-modes from gravitational waves so we can treat these 
as an independent Gaussian field; their power spectrum 
then only enters the Gaussian part of the covariance ma- 
trix. Similar comments would apply to Gaussian instru- 
ment noise. 

We can calculate the non-Gaussian part of the covari- 
ance matrix on the flat sky using the gradient approxima- 
tion Eq. (2). The gradient approximation is not suitable 
for accurate calculation of the lensed power spectra [14] . 
In addition, the flat-sky approximation introduces per- 
cent level errors in the lensed B-mode spectrum on all 
scales [20]. Fortunately, the errors from these approx- 
imations tend to cancel above I ~ 400 and the lensed 
B-mode spectrum can be computed to ~ 1% accuracy 
on such scales using Eq. (3). We anticipate that the non- 
Gaussian components of the covariance matrix computed 
with the gradient and flat-sky approximations will have 
a similar level of accuracy. 

We thus calculate the non-Gaussian part of the covari- 
ance matrix within the flat-sky approximation using the 
bandpower expression given in Ref. [13]: 
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}, (11) 



where A is the area of sky observed, = J £ei d 2 £i and 
similarly for ctj, and W is given by Eq. (4). This ex- 
pression assumes periodic boundary conditions both to 
remove the complications due to E-B mixing and the ad- 
ditional correlations between the observed Fourier modes 
due to the geometry. For surveys of a few- hundred deg 2 , 
the former should be negligible except on the largest 
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FIG. 5: (Colour online) Illustration of the non-Gaussian com- 
ponent of covariance matrix Cov((7f^ s , Cf 2 B ). The maximum 
value at the peak is 5 x 10 _16 /xK 4 but the scale has been set to 
illustrate the structure in the rest of the matrix more clearly. 



scales, and the latter can be dealt with by choosing wide 
enough bands. Neither of these is an issue for the full-sky 
observations we consider here, and in this case we can set 
A = 4-7T and use bands of width A£ = 1. This means that 
we can simplify this integral somewhat, since for a fixed 
length of ti = li and £j = £j the value of the integrand is 
dependent only on the angle between £i and £j and their 
moduli. Therefore we can take out the integral over £j 
and instead fix £j = , 0). We can also simplify the 
integral over £i to a one-dimensional integral in 8. We 
can summarise this as 
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— I d 2 £ t 
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(12) 



Figure 5 illustrates the structure of the non-Gaussian 
component of the covariance matrix as calculated for 
Model A. It can be seen that the correlation is present 
across all scales and is not confined to the region near 
the diagonal. Although the non-Gaussian covariance be- 
tween any two Cf B is small compared to the Gaussian 
covariance of the diagonal elements (~ 3 x 10~ 12 /iK 4 /<? 
for I < 300), the fact that the non-Gaussian covariance 
spans a wide range of scales means that its net effect is 
very significant. 

It is computationally expensive (taking several hours of 
CPU time) to re-calculate the covariance matrix for each 
set of model parameters as would be required in param- 
eter estimation. However, when considering variations 
in those parameters that are not well constrained by the 
primary anisotropies (w and f2„/i 2 here), the main effect 
on the lensed B-mode power spectrum is an overall scal- 
ing in amplitude. For this reason, we can approximate 
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FIG. 6: (Colour online) 95% confidence limits from a Fisher 
analysis. Top: Model A. Bottom: constraints for a model with 
same parameters as Model A except r = 0.005 and fl„/i 2 = 
0.004 (which we also allow to vary in the analysis). Blue 
dashed contours: limits if the lensed B-mode power spectrum 
is used, with the correct covariance matrix. Green solid line 
(top plot only): limits using the B-mode power spectrum 
and incorrectly assuming the data are Gaussian. Red filled 
area: theoretical limits if all unlensed power spectra are used 
including the power spectrum of the lensing deflection field. 



the covariance as 



Si 



S G + — 



Clj Q NG,fid 



(13) 



where Cg is the lensing contribution to the i?-mode power 
spectrum and Cf d is the same in the fiducial model. This 
is similar to the scheme suggested in Ref. [13]. 

We can calculate the theoretically-optimum con- 
straints on parameters 9i by performing a Fisher anal- 
ysis. For Gaussian fields, the Fisher information matrix 
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reduces exactly to [30] 

where X and Y can be TT, EE, TE or EE and 
Cov~ (Cf-,Cj) is the inverse covariance matrix of the 
various measured power spectra. If we wish to consider 
the information that can be obtained from the lensed B- 
mode power spectrum, we need to compute the Fisher 
matrix - {8 2 InP/ '86,86 where P = P{Cf B \0). If we 
can approximate this as Gaussian in the measured power 
spectrum, then provided the number of independent de- 
grees of freedom is much larger than unity 7 we find 

where Cf B is the total E-mode power spectrum. 

The tightest parameter constraints achievable under 
any circumstances from CMB data alone are those ob- 
tained from the power spectra of the unlensed, Gaus- 
sian, temperature and polarization fields together with 
the lensing deflection field. Some approximation to this 
situation may be achievable using the full likelihood func- 
tion for the lensed CMB fields [17, 18], or, with larger 
errors, from quadratic reconstruction techniques [15, 16]. 
In the top plot of Fig. 6 we show the constraints obtained 
from a Fisher analysis of Model A, if all parameter val- 
ues are fixed except r and w. We consider three cases: 
using only the lensed E-mode spectrum with the correct 
covariance matrix; the same but (wrongly) using only the 
Gaussian part of the covariance matrix; and using all of 
the unlensed CMB fields and the lensing deflection field 
(treated as Gaussian). In all cases we only kept modes up 
to I = 1200. In the latter case, we include the unlensed 
CMB temperature and electric polarization though these 
have very little impact since the parameters we allow to 
vary are chosen to be nearly degenerate with respect to 
these fields. If the non-Gaussianity is neglected the con- 
straints on w from the E-mode power spectrum alone are 
slightly tighter than those obtainable using all of the un- 
lensed fields, which shows immediately that we are mak- 
ing a false assumption [12]. It can be seen that the er- 
rors on r are almost unaffected by the non-Gaussianity 
whereas the errors on uu are significantly affected. How- 
ever the presence of the lensed E-modes does significantly 
increase the errors on r [31, 32], so if the gravitational 
wave amplitude is small (r < 0.01) it will be desirable to 
attempt reconstruction of the unlensed CMB in order to 
detect the signal. 

The degeneracy between w and f2„/i 2 that is present 
when only the B-mode power spectrum is used can be 



7 It is in just this limit of a large number of degrees of freedom 
that the likelihood will approach a Gaussian. 



clearly seen in the bottom plot of Fig. 6. Here, r, w and 
Q, v h 2 are allowed to vary around a fiducial model with 
the same parameters as Model A except for a smaller 
r = 0.005 and massive neutrinos with Sl^/i 2 = 0.004. 
It should be recognised that a Fisher analysis, based on 
the derivatives of the power spectra around the fiducial 
model values, is good at highlighting degeneracies but 
does not give accurate error contours for degenerate pa- 
rameters [23]. The actual degeneracy between w and 
il„/i 2 follows a curve rather than a straight line. The 
degeneracy is broken by using the power spectrum of the 
lensing deflection field, although even then the variables 
are still correlated to a lesser extent. In the unlensed, 
foreground-free ideal case, the errors on r are propor- 
tional to r and so there is no lower limit to the value of 
r that can be detected. In practice foreground residuals 
(and other systematic effects) are expected to pose a sig- 
nificant challenge and may well be the factor that limits 
the constraints that can ultimately be achieved. 



V. NON-GAUSSIAN LIKELIHOOD 

Having shown clearly that the non-Gaussianity of 
lensed E-modcs cannot be ignored, we now address the 
question of how it may correctly be taken into account 
during parameter estimation from measured power spec- 
tra. To do this, we need to employ a likelihood function 
which includes information about the dependencies be- 
tween the power on different scales. In principle, the 
exact likelihood can be found from that for the lensed 
fields [17, 18], but this would be very complicated, and 
such an analysis would probably be no simpler than an 
optimal analysis with the fields themselves (which would 
avoid the lossy compression to power spectra). Instead 
we proceed heuristically, modelling the likelihood as a 
function of the lensed spectrum and its covariance only. 
This necessarily assumes that the connected moments 
of the lensed field can be ignored above the four-point 
level, or, more correctly, that they can be approximated 
in a hierarchical manner by the lower moments. We only 
consider noise-free analysis on the full sky here, or, in 
Sec. VI, on a periodic flat sky. Since our approach is 
based on the measured power spectrum, the additional 
(geometric) complications from the survey geometry and 
those from instrument noise can be dealt with by stan- 
dard techniques (see e.g. [33]). 

A suitable likelihood function must reduce to a good 
approximation to the true likelihood in the limit that 
the non-Gaussian covariance tends to zero, Eq. (8). Note 
that it is the dependence of this function on the cosmo- 
logical parameters that is of interest. The coupling of 
power between different scales by non-Gaussianity has a 
similar effect to the geometric coupling between scales 
that arises in observations of Gaussian fields over only 
part of the sky. A number of approximate likelihoods 
have been suggested for this latter problem, and a sim- 
ple strategy for analysing lensed E-mode spectra might 
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be to replace the (geometric) covariance matrix in these 
approximations by the non-Gaussian covariance matrix. 

A possible first approximation, a likelihood function 
that is Gaussian in Ci, is known to produce biased pa- 
rameter constraints from Gaussian fields [34]. A better 
approximation is the log-normal distribution in which the 
likelihood is Gaussian in the log of the power. This was 
originally developed for the case where the peak of the 
likelihood (considered as a function of the theory Ci 8 ) 
and its curvature there are known [34], but it can eas- 
ily be tailored to our problem by replacing the modal 
Cg by the measured Cg, and the (inverse) curvature by 
the theoretical covariance matrix. (This likelihood was 
denoted by £^n m R- CI - P5], with the prime distinguish- 
ing it from the original formulation in Ref . [34] ; we make 
the same distinction in Eq. 16 below.) The log-normal 
likelihood has been employed in the analysis of several 
different CMB data sets, e.g. [36-40]. However, this like- 
lihood is also biased; the level of bias is acceptable for 
many data sets but becomes significant when using full- 
sky data with a low noise level. The first-year WMAP 
data were analysed using a likelihood function which is 
a weighted combination of the Gaussian and log-normal 
likelihoods [35]: 

lnP(C|0) = ilnP Gauss (C|0) + |lnP^ N (C|0). (16) 

This is a significantly better approximation to the exact 
likelihood in the limit of no non-Gaussianity; see Fig. 10 
of Appendix A for comparisons. 

To illustrate the level of bias of the different likeli- 
hood functions when estimating r and w, we created 
a Gaussian simulation of the CMB using the lensed B- 
mode power spectrum, and analysed the resulting mea- 
sured power spectrum. The results are shown in Fig. 7. 
It can be seen that the constraints obtained when the 
WMAP likelihood function is employed are virtually in- 
distinguishable from those obtained using the exact like- 
lihood given by Eq. (8), although the Gaussian and log- 
normal approximations do lead to significant differences. 

Figure 8 shows the marginalised distribution of w ob- 
tained when the WMAP likelihood function is used to 
estimate parameters from the ten lensing simulations of 
Fig. 4. The non-Gaussian covariance matrix was included 
with the prescription in Eq. (13). Although from the 
spread of the maximum-likelihood values it seems that 
the width of the distributions are a good reflection of 
the uncertainties in the parameters, there is a significant 
bias towards low values of w, which means that, for most 
of the simulations, the parameter estimates are inconsis- 
tent with the input values. The marginalised distribu- 
tions of r are very similar to those obtained when Eq. 



For Gaussian fields, the parameters only enter the likelihood 
through the power spectrum. 
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FIG. 7: (Colour online) Likelihood contours at 67, 95 and 
99% confidence limits from analysis of a Gaussian simulation 
of the CMB made using the lensed power spectrum. Red 
solid contours: using the exact likelihood function. Green 
dotted contours: using the Gaussian approximation to the 
likelihood function, showing the significant difference from the 
exact likelihood. Light blue dot-dashed contours: using the 
log-normal distribution, which gives a smaller, and opposite, 
shift. Dark-blue dashed contours: using the WMAP likeli- 
hood function, showing almost indistinguishable results from 
the exact likelihood. The fiducial model values are marked 
with a cross. 



(8) was applied to the same simulations. This indicates 
that the non-Gaussian contribution to the covariance is 
less significant on large scales, and hence the presence 
of non-Gaussianity does not introduce a bias, or affect 
the errors, on r. This is understandable since the signal 
which contains the information about r is Gaussian to a 
high level. 



VI. NEW LIKELIHOOD FUNCTION 

In the previous section we showed that none of the 
existing likelihood approximations in the literature ap- 
pear suitable for the analysis of future cosmic-variance 
limited, lensed £?-mode data. For this reason we have re- 
examined the issue of likelihood approximations in CMB 
analysis; our findings can be found in Appendix A. The 
main result is a new likelihood function that appears 
to out-perform existing approximations. Here we sum- 
marise this new likelihood function and demonstrate that 
it produces accurate parameter constraints on simulated 
lensed data. 

To begin, it is worth first reviewing the underlying is- 
sues that arise when constraining parameters from non- 
Gaussian fields. For ideal, full-sky observations, in the 
absence of non-Gaussianity, the probability of the ob- 
served fields d given the set of cosmological parameters 
can be written as a function of the theoretical power 
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FIG. 8: Marginalised distributions of w obtained for three 
Model-A simulations (top) and seven Model-B simulations 
(bottom), using the WMAP likelihood function. 

spectrum Ci(9) and the measured power spectrum Cf. 

P(d\9) = P(d\C e ) = f(dt)P(d t \C £ ), (17) 

where f(Ce) is a function of the measured power spec- 
trum only. It follows that Ct are sufficient statistics for 
parameter estimation and hence there is no loss of infor- 
mation in compressing the data to the measured power 
spectrum. For non-Gaussian fields, even for ideal ob- 
servations, the measured power is no longer a sufficient 
statistic and the likelihood may not just be a function of 
the theoretical power spectra of the fields. Nevertheless, 
we can regard the measured power spectrum as a form of 
lossy compression of the data, and the relevant likelihood 
is then the sampling distribution P(Ci\6). 

In Appendix A we derive a new likelihood by approx- 
imating the properly- normalised distribution P(Ci\9) as 
Gaussian in some function of the Ce . The likelihood takes 
the form 

\nP(6i\0)n}nA-±^Mu}(xt-iii)(xt>-iJit>), (18) 

w 



where 

xi = C\ lz (19) 

/2/-1 \ 1/3 
« - (^C) , (20, 

and 

(21) 

where Sw is the covariance matrix of the measured Ce 
at parameters 8. The normalisation is 

A^ 1 oc y/dctM ee ,Y[f4, (22) 

l 

which we approximate here by A oc Y[g Note that 

the normalisation depends on the cosmological parame- 
ters and including this dependence is important to get 
an accurate approximation for the 0-dcpcndence of the 
likelihood in the tails. 

In Appendix A we show that this new likelihood is 
considerably more accurate than other approximations in 
current use when tested on full-sky, Gaussian data when 
the number of degrees of freedom is low (i.e. low I). When 
applied to Gaussian simulations, we find parameter con- 
straints very similar to those shown for the WMAP (and 
exact) likelihood functions in Fig. 7. However, the new 
likelihood produces very different results when applied to 
non-Gaussian lensed simulations with the non-Gaussian 
covariance employed, as shown in Fig. 9. It can be seen 
here that for all three Modcl-A simulations the parame- 
ters constraints obtained are consistent with the values 
used in the simulations, unlike those obtained when the 
WMAP likelihood function is used. The results from 
the Modcl-B simulations arc similar. In all cases the 
marginalised constraints on r are nearly identical to those 
obtained when Gaussianity is assumed, which indicates 
that the non-Gaussianity does not noticeably affect the 
estimated value of r, although the presence of the lensed 
power spectrum itself does influence the bounds we can 
place on r. The parameter constraints are consistent with 
the theoretical errors obtained from the Fisher analysis 
shown in Fig. 6. 

In order to test the new likelihood function further, 
we performed flat-sky lensed simulations, which can be 
computed much more rapidly than the full-sky simula- 
tions. It was found that, with a pixel size of 1.3 arcmin, 
accurate power spectra could be obtained by interpola- 
tion provided that a cubic interpolation method was used 
(a simple linear interpolation resulted in an inaccurate 
lensed power spectrum). Periodic boundary conditions 
were specified. The amplitude of the lensed -B-mode 
power spectrum does not vary linearly with w, and for 
measured power spectra with large uncertainties, this re- 
sults in a distribution of w which is highly skewed and 
stretches far into the region w < — 1. To avoid this, we 
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FIG. 9: (Colour online) A comparison of the parameter constraints obtained for the three Model-A simulations, using different 
likelihood functions, with contours shown at 67, 95 and 99% confidence limits. Red dot-dash contours: constraints obtained 
assuming Gaussianity. Green dashed contours: using WMAP likelihood function. Blue solid contours: using new likelihood 
function. The fiducial model value is shown with a cross. 



used maps which were 91.7° across to reduce the sample 
variance. It should be noted that for this size of map ap- 
proximating the sky as flat is clearly incorrect; the statis- 
tics of the lensed modes in the simulations will differ at 
the per-cent level from the spherical expectations on all 
scales [20] , and more so for those modes approaching the 
survey size. We performed 150 of these simulations, with 
Model-A parameters. 

The estimated power spectrum of the flat-sky maps is 
calculated as bandpowers: 



IE 



A 2tt£ 



(23) 



lei 



where B(l) is the measured -B-mode, the sum is over val- 
ues of i that lie in band i, and, recall, A is the area of 
the sky. In the limit that (. 2 Ci/(2tt) is constant within 
each band, we find that (%) = PC 1 J{2tt). The WMAP 
likelihood function can straightforwardly be generalised 
to work with bandpowers. For the new likelihood func- 
tion we need to drop the (21 — l)/(2£ + 1) factors; they 
are more significant at low-f where the flat-sky approxi- 
mation does not hold anyway. The bandpower version of 
the new likelihood function can then be expressed as 
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where the Gaussian part of the bandpower covariancc 
matrix b is given by 
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and the non-Gaussian part can be calculated from the 
non-Gaussian part of the the full-sky covariancc matrix 
as: 
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Since the posterior distribution of w is skewed, as men- 
tioned above, the maximum-likelihood value is greater 
than the mean value. Using the WMAP likelihood func- 
tion, both the mean and maximum-likelihood values of w 
were significantly biased, with values (averaged over the 
simulations) of -1.075±0.009 and -1.044±0.007 respec- 
tively. The quoted errors are standard errors in the mean 
of 150 simulations, and were estimated from the simula- 
tions. For both the mean and maximum-likelihood value, 
the bias is comparable to the random error on the mea- 
sured value; the situation would worsen if we considered a 
larger survey area as the random error would fall but the 
bias would remain. Using the new likelihood, the mean 
of w was —0.992 ± 0.007 and the maximum likelihood 
value — 1.017 ± 0.007, showing that this likelihood func- 
tion performs significantly better. A comparison between 
the width of the marginalised distributions of w and the 
spread of the mean values estimated from each of the flat- 
sky simulations gave good agreement, to around one per 
cent. This shows that the estimated errors on the value 
of w are correct once the non- Gaussian covariance is in- 
cluded. There was a slight positive bias in the mean of r 
of about 0.5 per cent for the new likelihood and around 
twice this for the WMAP likelihood. This is about 30% 
of the random error on r for the sample- variance limited 
observations considered here, but we expect that the bias 
is an artifact of the flat-sky simulations on large scales. 



VII. CONCLUSIONS 

The generation of B-mode polarization by weak gravi- 
tational lensing provides a way of measuring cosmological 
parameters from the CMB alone that would otherwise 
be poorly constrained. We showed that a power spec- 
trum analysis is wasteful, and in particular suffers from 
a strong degeneracy between the equation of state of the 
dark energy and neutrino masses. A more optimal analy- 
sis that reconstructs the lensing deflection field can break 
this degeneracy [27], as can external data. Nevertheless, 
compression into the measured power spectrum will be 
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a useful first step in the analysis of future _B-mode data 
as it is likely to be more robust against real-world effects 
than more optimal reconstruction techniques. With this 
in mind, we have shown that it will be essential to take 
the non-Gaussianity of the lens-induced S-modes into 
account in a high signal-to-noise power-spectrum analy- 
sis, not only to avoid under-estimation of errors [13] but 
also to remove biases in the parameters that affect lens- 
ing; constraints on the gravitational-wave amplitude arc 
not noticeably affected by non-Gaussianity. Including the 
non-Gaussian covariance of the measured B-mode power 
spectrum in existing likelihood functions that have been 
used for parameter estimation from CMB spectra was 
shown to give biased results on noise-free simulations of 
lenscd CMB fields. To remedy this we developed a new 
likelihood function that is more accurate in the tails of 
the distribution when the number of independent modes 
is low. Due to the non-Gaussian nature of CMB lensing, 
the number of such independent modes below a given 
multipole is lower than for Gaussian fields. We verified 



on simulations that the new likelihood function performs 
much better than existing approximations for the par- 
ticular application considered here. However, we expect 
that it will be more generally applicable for the accurate 
analysis of the spectra of Gaussian CMB fields on large 
scales. 
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APPENDIX A: LIKELIHOOD 
APPROXIMATIONS 

In this appendix we derive the likelihood approxima- 
tion employed in Sec. VI. Our strategy for dealing with 
the non-Gaussianity of lens-induced -B-mode data in this 
paper is heuristic: we include the non-Gaussian correla- 
tion between power on different scales but our choice of 
likelihood is motivated by the analysis of Gaussian fields. 
For this reason, our new likelihood approximation should 
also be useful more generally in CMB analysis. We only 
consider observations of a single field here {Cf); we plan 
to address the more general problem of joint analysis of 
correlated fields in a future publication. 

We develop two approximations motivated by the two 
ways that CMB power spectra are usually obtained. In 
the first, we work with the Gaussian CMB field d di- 
rectly, and aim to characterise the probability P(d\Cg) 
as a function of the theoretical power spectrum Cg that 
encodes all of the cosmological information 0. Exploring 
P(d\Ci) is computationally expensive so often the modal 
value of Cg and the curvature (or a Fisher approximation 
to it) are found, e.g. Ref. [42], and used in an analytic 
approximation to the GVdependcncc of P(d\Cg). Com- 
plications due to, for example, the survey geometry, are 
accounted for only through their impact on the curvature 
matrix and modal Cg . We construct an approximate like- 
lihood by looking for variables xgiCg) in which the exact 
likelihood is accurately represented by a Gaussian, i.e. 
we approximate the CV-dcpcndcncc of lnP(d|C^) as 



\nP(d\Cg) w ~^M u }{x t - fj,t)(x e > - n e >) 



(Al) 



up to an irrelevant constant. If this approximation is 
to peak at the correct place, we require [ig = xg(Cg }ln \), 
where Cg, m \ is the modal (or maximum-likelihood) value 
of Cg. Similarly, if the curvature at the peak is —J-u.'-, we 
must have 



m; 



dCg dCV 
dxg dxg> 



(A2) 



where the derivatives are evaluated at the peak. To con- 
strain the variable change xg(Cg), we examine the third 
derivatives of In P(d\Cg) with respect to the xg: 



d 3 InP 

dxgdxg'dxgn 

5p 



SffSf 



d\nPd 3 C f 



dCg dx 3 
d 2 \nP d 2 CgdCgn 



dCgdCpji dx 2 dx£" 

d 3 lnP dCgdC 



+ cyclic perms 
dCt» 



dCgdCg>dCg» dxg dxg' dxg" 



(A3) 



where have imposed that xg is only a function of Cg at 
the same t. Setting the right-hand side exactly equal 
to zero is, in general, inconsistent. Instead, we obtain 
a tractable and consistent problem if we demand that it 
vanish everywhere when we replace the derivatives of P 
on the right with their expectation values evaluated with 
the likelihood appropriate to full-sky, noise-free observa- 
tions [34]: 

\nP(d\Cg) = - Y,{1 + l/2){C e /d + InCg), (A4) 



where Cg is the measured Cg. Evaluating the expectation 
values of the derivatives of this likelihood (over data from 
an ensemble with power Cg), we find from Eq. (A3) that 
we must have 



A(dCg/dxgf = 3Cgd 2 Cg/dxl 



(A5) 



Note that the full-sky number of degrees of freedom 2^+1 
present in the derivatives cancels in this equation so we 
expect this procedure to remain valid for observations 
covering only a fraction / s k y of the sky. Equation (A5) 

— 1/3 

is solved by xg = C g , and gives an approximation for 
the CVdependence of h\P(d\Cg) of the form 



lnP(d|C £ 



4/3 T ,,^4/3 



U> 



-1/3 



1/3 



r--i/3 

W',ml 



)],(A6) 



up to an irrelevant constant. The essential difference 
with the log-normal approximation developed in Ref. [34] 
is that they demand that the expectation of the curva- 
ture of the likelihood be constant (when approximated 
by Eq. A4) whereas we force the expectation value of 
the derivative of the curvature to be zero. These are 
not equivalent since the latter preserves the distinction 
between Cg }Ta \ and Cg until after the derivative of the 
curvature is taken while the former docs not. 
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To include instrument noise, we repeat the above steps 
but now include isotropic noise in the full-sky likelihood 
in Eq. (A4). The effect in our approximate likelihood, 
Eq. (A6), is to add a noise offset Nt to Ce,mi and Ce 
in the right-hand side. Following Ref. [34], we suggest 
determining Ne by fitting the model 



(21 + l)/ sky 



(C/,mi + N,f 



(A7) 



to the diagonal elements of the inverse curvature at the 
peak. 

The second approximation we consider is that which is 
used in Sec. VI. In this case, we first compress the data 
down to a measured power spectrum Ce and then ask 
what is the sampling distribution for the Ce given the 
cosmological model 91 We then use the ^-dependence 
of this probability as the likelihood in parameter estima- 
tion [43]. For noise- free observations of Gaussian fields 
over the full sky, the sampling distribution is given by 
Eq. (8) which on taking logs becomes 



\nP(C e \Ce) 



E 



/2<?+lC £ 2£+l 
V 2 C~e + 2 

2£-l 



InCe 



lnC e 



(A8) 



up to a constant that is independent of Ce and the the- 
oretical spectrum Ce . This approach is more natural for 
the non-Gaussian problem considered in the body of this 
paper since there we perform (lossy) compression down 
to Cf B , and the power spectrum of the lensed I?-modcs 
does not fully characterise their statistics. We now look 
for a variable transformation xe(Ce) in which the sam- 
pling distribution P(Ce\9) is approximately Gaussian: 

]nP(Ce\e)^lnA-lj2 M uH^-^)(^-^e), (A9) 



where A is a Cf-indcpendcnt (but cosmology-dependent) 
normalisation. We assume that we can calculate the 
mean (Ce) and covariance See 1 of the Ce given the cos- 
mological model. Here we shall assume that Ce is con- 
structed to be unbiased so (Ce) = Ce although this as- 
sumption can easily be dropped. Since the mean and co- 
variance of P{Ce\Q) are specified rather than the modal 
value of Ce and the curvature there, we postpone their 
determination until we have determined the functional 
form of xe- This is obtained by examining the third 
derivatives of the sampling distribution with respect to 
Ce- These are given by Eq. (A3) but with Ce and xe 
replacing Ci and xe- Again we approximate the deriva- 
tives with respect to Ce, but this time with those derived 
from the distribution in Eq. (A8). The second and third 
derivatives are independent of Ce and we make no further 
approximation with these. We ignore the first derivative 
term since it is small near the peak, so, on setting the 



(approximate) third derivative of InP with respect to xe 
to zero, we find 



2(dQ/d^) 2 = 3Ced 2 Ce/dx 2 e . 



(A10) 



' 1 /3 

This is solved by xe — C e . We could now determine 
Hi and Mw by demanding that the approximate distri- 
bution gives the correct mean and covariance for Ce- A 
simpler method, which we adopt here, is to relate fie and 
Mee 1 to the (known) mean and covariance using Eq. (A8) 
as a guide. For that ideal distribution, the peak is at 
[(21 - \)/ (21 + l))(Ce) and the curvature at the peak 
is related to (minus) the inverse variance by a factor 
[(2£ + l)/(2l - 1)]. This motivates setting 



M7 f } 



2£-l 
21+1 



1/3 



21 + 1 dCe 
21- 1 ~d± t 



^ DO' 



21' + 1 dCV 



2C 



1 die' 



(All) 



(A12) 



where the derivatives are taken at the peak where xe = 
lie. If we use this to approximate the exact result 
Eq. (A8) and evaluate the mean and standard deviation 
of Ce , we find 



(Ce) 
Ce 



= 1 



'2£ + 1 
~2CT 



ACe 



2 r2 

— <r + 
27 



(A13) 
(A14) 



where 6 = (£ — 1/2) _1 . At £ = 2 the fractional error 
on the mean and standard deviation arc 3% and 4% re- 
spectively; by £ = 10 these have dropped to 0.1% and 
0.6%. It remains to fix the normalisation. Exponenti- 
ating Eq. (A9) and integrating with the approximation 
dCe ~ Sfijdxe, we find that 



^/detMee- \ 



(A15) 



where the proportionality constant is independent of the 
cosmological parameters 6. In the text we further ap- 
proximate the normalisation by A oc 1/ Y[t Ce- 

To generalise our approximate sampling distribution to 
include instrument noise, we assume that the measured 
power spectrum has had a noise bias Ne removed, so that 
(Ce) = Ce still holds. The modifications we suggest are 
then xe = (Ce + Ne) 1 / 3 and replacing Ce by C'e + Ne in 

Ve- 
in Fig. 10 we compare our two new likelihood approx- 
imations with the exact distribution, and also with the 
log-normal distribution proposed in Ref. [34] and the like- 
lihood used in the first-year WMAP analysis [35] , for the 
case of noise- free, full-sky observations of Gaussian fields. 
In this case, the measured power spectrum Ce equals the 
maximum likelihood spectrum Ce <m \, and working from 
the exact sampling distribution for Ce produces the same 
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FIG. 10: (Colour online) Comparison of likelihood approximations for Ci for noise-free, full-sky observations of Gaussian 
fields. Left: the exact distribution (red, solid), Eq. (A4); our new result (blue, dashed) in Eq. (A6) that assumes the peak 
position and curvature there; and our alternative result (green, dotted) in Eq. (A9) that works from the sampling distribution 
of the measured power spectrum, for 1 = 2 (left panel) and t = 10 (right panel). Note that the green dotted line is barely 
distinguishable from the exact distribution. Right: the log-normal distribution (blue, dashed); the WMAP likelihood (green, 
dotted) used in the analysis of the first-year data; and the exact likelihood (red, solid). In all cases e = Ci/Ce, where Ce is the 
measured Ce ; Ce is also the maximum likelihood Ce for the ideal conditions assumed. 



distribution for Ce as working directly from the exact 
P{d\C(). Our new approximations are seen to be more 
accurate than the existing approximations, and particu- 
larly so when the number of degrees of freedom (2£+ 1 in 
this example) is small. Our second approximation, based 
on the sampling distribution of the measured Ce, is seen 



to be more accurate in the tail of the distribution than 
our first approximation that starts from the peak posi- 
tion and curvature. This appears to be because of the 
non-perturbativc way in which the theoretical Ce appear 
in the normalisation of the sampling distribution. 



